Multi-armed bandit

Results: 113



#Item
51Multi-armed bandit / Stochastic optimization / Problem solving / Learning / Game theory / Statistics / Educational psychology / Machine learning

REV_ISS_WEB_COGS_12052_37

Add to Reading List

Source URL: www.indiana.edu

Language: English - Date: 2013-11-25 13:26:38
52Maximum likelihood / Kullback–Leibler divergence / M-estimator / Likelihood function / Reinforcement learning / Divergence / Multi-armed bandit / Marginal likelihood / Extremum estimator / Statistics / Estimation theory / Expectation–maximization algorithm

Expectation Maximization for Weakly Labeled Data Yuri Ivanov MIT Media Laboratory, 20 Ames St., E15-390, Cambridge, MA 02139, USA

Add to Reading List

Source URL: characters.media.mit.edu

Language: English - Date: 2003-06-30 17:21:48
53Multi-armed bandit / Stochastic optimization / Prospect theory / Probability / Statistics / Decision theory / Machine learning

When biases under risk are optimal under uncertainty and learning. Overestimation of low probabilities and status quo bias Michel De Lara∗ October 29, 2011

Add to Reading List

Source URL: www.parisschoolofeconomics.eu

Language: English - Date: 2012-12-19 16:58:45
54Statistical theory / Foraging / Marginal value theorem / Reinforcement / Exponential distribution / Prior probability / National Institutes of Health / Optimal foraging theory / Multi-armed bandit / Statistics / Bayesian statistics / Ecological theories

NIH Public Access Author Manuscript J Exp Psychol Anim Behav Process. Author manuscript; available in PMC 2008 December 3. NIH-PA Author Manuscript

Add to Reading List

Source URL: www.ncbi.nlm.nih.gov

Language: English
55Stochastic optimization / Dynamic programming / Albert Shiryaev / Stochastic / Optimal control / Multi-armed bandit / Optimal stopping / Mathematical finance / Applied mathematics / Statistics / Mathematical optimization / Operations research

Selected Publications: Monographs: Sequential Control with Incomplete Information: The Bayesian Approach to Many-Armed Bandit Problems, Academic Press, 1990, 266 p., English translation of: Moscow, Nauka, 1982, 256 p., (

Add to Reading List

Source URL: mse-msu.ru

Language: English - Date: 2011-04-18 03:41:09
56Stochastic optimization / Machine learning / Multi-armed bandit / Algorithm / PP / Theoretical computer science / Applied mathematics / Statistics

Competitive Collaborative Learning Baruch Awerbuch1 ? and Robert D. Kleinberg2

Add to Reading List

Source URL: www.cs.jhu.edu

Language: English - Date: 2007-10-09 11:15:35
57Markov models / Signaling game / Markov processes / Lewis signaling game / Markov chain / Reinforcement learning / Multi-armed bandit / Signaling / Game theory / Statistics / Asymmetric information

Draft JanLearning to Signal with Two Kinds of Trial and Error Brian Skyrms 1. Low Rationality Game Theory

Add to Reading List

Source URL: www.imbs.uci.edu

Language: English - Date: 2014-11-04 13:01:47
58Developmental psychology / Reinforcement learning / Multi-armed bandit / Learning / Pi / Statistics / Mathematical analysis / Markov models

Learning for Contextual Bandits Alina Beygelzimer 1 John Langford

Add to Reading List

Source URL: hunch.net

Language: English - Date: 2010-09-23 14:42:03
59Reinforcement learning / Q-learning / Multi-armed bandit / Learning automata / Reinforcement / Motivation / Markov decision process / Mountain Car / Statistics / Machine learning / Artificial intelligence

Journal of Arti cial Intelligence Research Submitted 9/95; published 5/96 Reinforcement Learning: A Survey Leslie Pack Kaelbling

Add to Reading List

Source URL: www.societyofrobots.com

Language: English - Date: 2010-01-10 09:36:37
60Learning / Stochastic optimization / Multi-armed bandit / Reinforcement learning / Supervised learning / Mathematical optimization / Statistics / Machine learning / Artificial intelligence

Large-Scale Bandit Problems and KWIK Learning Jacob Abernethy Kareem Amin Computer and Information Science, University of Pennsylvania

Add to Reading List

Source URL: jmlr.org

Language: English - Date: 2013-08-14 01:36:42
UPDATE